An Exploratory Study On The Appropriateness Of Latent Dirichlet Allocation For Automatic Discovery Of Product Associations From User-Generated Content

نویسندگان

  • Johannes Putzke
  • Kai Fischbach
  • Detlef Schoder
چکیده

Latent Dirichlet Allocation (LDA) is a method that can be used to generate word association networks from unstructured text documents. However, no study has yet examined the applicability of LDA for deriving product associations from user-generated content. In this work, we apply LDA on 9,529 unstructured and uncategorized McDonald’s product reviews that were crawled from a German online review platform. We evaluate the applicability of LDA for deriving product associations from user-generated content. For this reason, we conducted a survey among 95 Information Systems undergraduate students about their associations with 17 McDonald’s-related nouns. Results indicate that LDA is a valid method for deriving product associations from user-generated content.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Sex, drugs, and violence

Automatically detecting inappropriate content can be a difficult NLP task, requiring understanding context and innuendo, not just identifying specific keywords. Due to the large quantity of online user-generated content, automatic detection is becoming increasingly necessary. We take a largely unsupervised approach using a large corpus of narratives from a community-based self-publishing websit...

متن کامل

Automatic keyword extraction using Latent Dirichlet Allocation topic modeling: Similarity with golden standard and users' evaluation

Purpose: This study investigates the automatic keyword extraction from the table of contents of Persian e-books in the field of science using LDA topic modeling, evaluating their similarity with golden standard, and users' viewpoints of the model keywords. Methodology: This is a mixed text-mining research in which LDA topic modeling is used to extract keywords from the table of contents of sci...

متن کامل

Using Latent Topic Features for Named Entity Extraction in Search Queries

Search is one of the most quickly growing applications in the mobile market. As people rely more on portable devices for performing search, it becomes increasingly important to analyze user queries in order to achieve more targetted results over a broad set of search entities. While most previous work has relied on lexico-syntactic features and handcrafted knowledge sources, this paper investig...

متن کامل

An Approach to Discovery and Re-ranking of Educational content from the World Wide Web using Latent Dirichlet Allocation

With tremendous increase in the amount of digital data available educators are forced to author content for learning and teaching for use in their classes. With that there has emerged a need to facilitate automatic discovery of learning resources from the World Wide Web. In this work, we present a novel approach for discovering content from the web for e-learning. We argue that for an e-learnin...

متن کامل

Exploring Latent Semantic Factors to Find Useful Product Reviews

Online reviews provided by consumers are a valuable asset for e-Commerce platforms, influencing potential consumers in making purchasing decisions. However, these reviews are of varying quality, with the useful ones buried deep within a heap of non-informative reviews. In this work, we attempt to automatically identify review quality in terms of its helpfulness to the end consumers. In contrast...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013